On the Properties of Bit String-Based Measures of Chemical Similarity

نویسنده

  • Darren R. Flower
چکیده

With the growth of interest in database searching and compound selection, the quantification of chemical similarity has become an area of intense practical and theoretical interest. One of the most widely used methods of measuring chemical similarity is based on mapping fragments within a molecule as bits within a binary string. We present empirical results which suggest that bit strings provide a nonintuitive encoding of molecular size, shape, and global similarity. Other results, this time statistical in nature, suggest that the observed behavior of bit string-based searches have a large nonspecific component. On this basis, we question whether bit string-based similarity methods possess all the features desirable in a quantitative chemical distance measure or metric and suggest that there are instances when they may not be the most appropriate tool for searching or segregating chemical structures.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Similarity and Dissimilarity Methods for Processing Chemical Structure Databases

This paper reviews measures of similarity and dissimilarity between pairs of chemical molecules and the use of such measures for processing chemical databases. The applications discussed include similarity searching, database clustering and diversity analysis, focusing upon measures that are based on fragment bit-string occurrence data. The paper then discusses recent work on the calculation of...

متن کامل

Drill string Vibration Modeling Including Coupling Effects

Abstract: The governing equations of motion for a drill string considering coupling between axial, lateral and torsional vibrations are obtained using a Lagrangian approach. The result leads to a set of non-linear equations with time varying coefficients. A fully coupled model for axial, lateral, and torsional vibrations of drill strings is presented. The bit/formation interactions are assumed ...

متن کامل

An Empirical Comparison of Distance Measures for Multivariate Time Series Clustering

Multivariate time series (MTS) data are ubiquitous in science and daily life, and how to measure their similarity is a core part of MTS analyzing process. Many of the research efforts in this context have focused on proposing novel similarity measures for the underlying data. However, with the countless techniques to estimate similarity between MTS, this field suffers from a lack of comparative...

متن کامل

Evaluation of Similarity Measures for Template Matching

Image matching is a critical process in various photogrammetry, computer vision and remote sensing applications such as image registration, 3D model reconstruction, change detection, image fusion, pattern recognition, autonomous navigation, and digital elevation model (DEM) generation and orientation. The primary goal of the image matching process is to establish the correspondence between two ...

متن کامل

An Estimation of Required Rotational Torque to Operate Horizontal Directional Drilling Using Rock Engineering Systems

Horizontal directional drilling (HDD) is widely used in soil and rock engineering. In a variety of conditions, it is necessary to estimate the torque required for performing the reaming operation. Nevertheless, there is not presently a convenient method to accomplish this task. In this paper, to overcome this difficulty based on the basic concepts of rock engineering systems (RES), a model for ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of Chemical Information and Computer Sciences

دوره 38  شماره 

صفحات  -

تاریخ انتشار 1998